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M ethod of selecting among N "Sp atial Video rODRrs" 
the optimum "Spatial Video CQDRn" far « same input sip nal T 

The present invention relates to a method of selecting among N "Spatial Video 
CODECS" where N is an integer number greater than 1, the optimum "Spatial 
Video CODEC" for a same input agnal L In this new technique (hereafter referred 
to as "Dynamic Coding") for digital video coding, "Spatial Video CODEC" is 
understood as the combination of any transform of the input signal, followed by a 
quantization of the transform coefficients and a corresponding entropic coder. 

Video Coding is an important issue in all appUcation fields where digital video 
information has to be stored on a digital support or transmitted over digital net- 
works. Several solutions have been proposed in the last 20 years and standardiza- 
tions efforts have been undertaken to define a unified syntax. 

Standard video coding schemes have a rigid structure. Hiey take into account the 
context of specific, well-defined applications requiring video coding, and propose 
an optimized, albeit limited, solution. This explains the number of existing inter- 
national recommendations that have been defined for specific appUcations. For 
example, the mj-T H.261 standard is designed for tele-conferencing and video- 
telephony appUcations, MPEG-1 for storage on CD-ROM, MPEG-2 for wide- 
band TV broadcast, MPEG-4 for low-biti:ate coding with multimedia functional- 
ities and H264 for veiy low bit-rate video coding. 

The strategy adopted by classical video coding schemes is pronqited by the feet 
tijat no a single universal coding technique can be ^hed with optimum results in 
every context In feet, the performance of a "Spatial Video CODEC" depends on 
several appUcation specific parameters, such as: the type of die data to be com- 
pressed (still pictures, video, stereo imagery, and so on), the native of the visual 
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data (natural, synthetic, text, medical, graphics, hybrid), the target bitrate, the 
maximum acceptable delay (ranging from few milliseconds to off-line), the mini- 
mum acceptable quality (spatial and temporal resolution), the type of communica- 
tion (point to point, broadcast, multi-point to point, etc. . .), and the set of function- 
alities required (scalabiUty, progressiveness, etc.). Often these parameters such as 
the nature of the iisput signal or the available bandwidth may change in time with 
a consequent variation of the perfomiances of the selected "Spatial Video 
CODEC". Ett the following table, major specifications for a few of the most critical 
^Hcations for video coding are listed. Reads "Mbps" Mega bit per second, 
"Kbps" kUobit per second, "35>s" frame per second, * *MP2MP" multi point to 
multi point, "P2P" point to point, "P2MP" point to multipoint 





Type of data 


Nature 
of data 


Target 
bitrate 


Max 
delay 


Minimum 
quatity 


Type of 
comm. 


Functionalities 


Video Surveil- 
lance 


Low motion 
Video 


Natural 


3-6 
Mbps 


300 
ms 


25-50 
FULL 


MP2M 
P 


Past firame 
access 


Video teleph- 
ony 


Static Video 


Natural 


<500 
Kbps 


200 

ms 


10 4)s, 
QCLF 


P2P 


Scalability 


Telemedicine 


Video 
Still pictures 


Hybrid 
Text 


>5 
Mbps 


<1 
sec 


50 ]5)s, 
FULL 


MP2M 
P 


Scalability, 
Editing,,,. 


Digital TV 


High motion 
Video 


Natural, 

text 
synthetic 


<1 
Mbps 


<1 
sec 


25 ^s, 
FULL 


P2MP 


Record, fast 
frame access 


Corporate TV 


Natural 
Text Syn- 
thetic 


Natural, 

text, 
synthetic 


2 

Mbps 


<1 
sec 


50 fys, 
FULL 


P2MP 


ScalabiUty 


Video confer- 
ence 


Natural 
Text 


Natural, 

text 
synthetic 


512 
Kbps 


<1 
sec 


25 fps, 
QCIF- 
CIF 


MP2M 
P 


Scalability 
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Given the wide variations in the requirements from application to apphcation, it is 
clear that a coding scheme tuned for a specific apphcation will be superior, for 
that apphcation, to an universal coding technique that attempts to find a suitable 
compromise among different constraints. However, even the optimum "Spatial 
Video Video CODEC" for a specific set of constraints maybe a sub-optimal solu- 
tion when the parameters of the apphcation are allowed to change through time. 
For example, in several multimedia scenarios, the video bxput combmes static 
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scenes to highly dynamic ones. Moreover, the sequ^ces may be natural images, 
or synthetic scenes or combination of the two (text, graphs, and natural images) 
with a consequent variation of the statistical properties of the input signal. 

5 The present invention proposes a method which is a suitable solution to achieve 
an optimum video coding despite the changes of the above discussed properties of 
the input signal. 

The method is defined by the characterizing portion of claim 1, while the depend- 
10 ent claims propose various CTibodiments. 

The new proposed paradigm to Video coding is based on the following idea: dy- 
namically evaluate the performances of several coders given the input data and the 
external constraints of the q>plication. This evaluation is performed on a block- 
15 based basis. The input image can be organized as a set of rectangular blocks 
whose size can range from 2 by 2 pixels to the size of the input image* The results 
of such an evaluation are used to select the best performing among the available 
"Spatial Video CODECs" for that given input data. A prototype implementing this 
strategy has proved its superiority against standard approaches. 

20 

The invention will now be disclosed with the help of the accompanying drawings. 

Figure 1 shows the general structure of any video coding scheme, 

25 Figure 2 shows MPEG-like simplified video coding scheme. 

Figure 3 shows how the proposed 'T)ynamic Coding*' fits in a standard video cod- 
ing scheme. 
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Figure 4 shows the scheme of the Dynamic Coding base block. 

Figure 5 shows examples of four different Normalized "Spatial Video CODECs", 

Figure 6 shows a statistical evaluation of flie "Spatial Video CODEC" alignment 
OTor, 

Figure 7 shows a standard procedure to compute the Rate and the Distortion of the 
"Spatial Video CODEC", 

Figure 8 shows a proposed Rate Distortion prediction scheme. 

Figure 9 shows a performance con:q)arison of three "Spatial Video CODECs" on a 
firame basis. 

Figure 10 shows the frequency of the selection of a "Spatial Video CODEC" dur- 
ing the encoding of a 10 minutes typical TV sequence. 

In order to efficiently encode any digital signal it is important to exploit all the 
redundancies that the signal contains. When the input signal is a video signal there 
are three different kind of redimdancies that can be exploited: the spatial, the tem- 
poral and tiie statistical redundancy. All existing video coding schemes apply dif- 
ferent techniques to exploit more or less all these redundancies. Based on these 
observations the general scheme of any video coding is represented in Figure 1. 
Each of the three main modules can be p^ormed differently. As already dis- 
cussed, the overall perfomiances of any "Spatial Video CODEC" depends on the 
nature of the input signal, on the constraints of the apphcation and clearly on the 
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combination of the right tools for each of the tiiree modules represented in Figure 
1. 

Because of practical constraints, it is not possible to imagine that all the possible 
5 solutions can be evaluated. Thus the approach adopted by the standards is to fix 
one combination tiiat provides the best conopromise in some specific scenario. 
This constraints the efSciency of the "Spatial Video CODEC", but simplifies its 
implementation. What is lacking in order to dynamically adapt the scheme to the 
extemal constraints is an efficient prediction of the perfoimances of each tooL 

10 

We will explain how it is possible to evaluate several tools to dynamically select 
the one that exploits optimally the spatial redundancy of the signal, by taking into 
account the properties of the previous tool xised to e3q>loit the tenq>oral redun- 
dancy and tiie following tool used to exploit the statistic redundancy. This evalua- 
15 tion is fast and efficient and boosts the coding performances of standard ap-- 
proaches. 

Hereafter, we refer to both the spatial and entropic redundancy modules as *'Spa- 
tial Video CODEC", In this document, as previoiisly mentioned, "Spatial Video 
20 CODEC" is understood as the combination of any transfomi of tiie input signal, 
followed by a quantization of the transform coefficients and a corresponding en- 
tropic coder. 

A simplified version of MPEG-like standard video encoder is shown in Figure 2, 
25 vfheic the Temporal redundancy is exploited with a block-matching approach 
(motion estimation and motion compensation), the spatial redundancy is exploited 
by DCT. Standard video coding schemes use the block based DCT transform in 
order to exploit the spatial redundancy present in the signal. The input signal is a 
natural image or a diGTerence of two natural images. The input is divided into 
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square blocks and each block is DCT transfomed. The resulted coefficiente are 
quantized to increase their redundancy. These quantized coefficients are encoded 
with a statistical coder and the results represent the output bitstream. By de- 
quantizing (scaling back) the quantized coefficients and by applying the inverse 
DCT transform, the decoded frame is obtained. This is stored and it is used as 
reference to predict the foUowing frames of the input sequence. Hiis scheme is 
conttoUed by a so called ««rate^artion» algorithm. This is a mechanism that 
controls the rate and/or the distortion of the video coding process. In particular it 
is possible to define an approximate rate and the "Spatial Video CODEC" wiU 
adapt the quantization step so as to provide the minimum distortion for the given 
rate. Note that the distortion may vaiy through time according to the complexity of 
the input scene and the performances of the "Spatial Video CODEC". Howev^, 
the rate of the compressed sequence will remain qjproximately constant which il 
useful to stream video content over a band-limited network. When a Kve or single 
pass encoding is necessary, the **rate-distortion» algorilhm must control the "Spa- 
tial Video CODEC" throng a feedback loop. The results of the previous encoded 
frames is used to predict the difficulty of encodmg the foUowing frame. Given this 
prediction, the '*rate^stortion» algoriflmi fixes the quantization step so as to ap- 
proximate the requested rate. The rate distortion algorithm may also fix the distor- 
tion (or the quaUty) of the encoded video while the rate is free to vaiy along time, 
m this case, a fixed quantization associated to the requested distortion is usuaUy 
defined and let unchanged along the sequence. Clearly, combination of the two 
objectives is also possible. 



25 The technique proposed in this document can be applied in the same context as the 
one described above. The scheme is the same as the one in Figure 2, except for the 
spatial redundancy module that is now substituted by the '-Dynamic Coding- 
module as depicted in Figure 3. The input frame can be a natural image: Intra 
frame (I frame); or the difference between two (or more) natural images: predicted 



15 



20 



CK/l.V474.12EP.2Dpt 



7 

j&ame (P or B frame). The dynamic coding scheme can be applied either on the 
whole input, or indqpendently on any rectangular sub-block of the input. The size 
of any sub-block may vary from 2x2 pixels to the size of the frame itself. The sub- 
blocks are non overlapping but they are all together covering the whole fi:ame. 

5 

Iq order to optimally exploit the spatial redundancies, it is possible to evaluate not 
only one single transform (as the DCT in the standard approaches) associated to a 
unique quantization and entropic coder, but an arbitrary number of other "Spatial 
Video codecs" composed of any possible transform with corresponding quanti- 

10 zation and entropic coder. As in a standard approach, a rate-distortion algorithm is 
used to provide to the Dynamic Coding block an indication on the expected distor- 
tion and rate. In our implem^tation, this indication comes in the form of a quality 
parameter Q that defines fb& quality of the encoded frames. When fixing the qual- 
ity parameter Q, the rate will vary according to the statistical properties of the ia- 

15 put signal, otherwise, it is possible to chose the right Q in order to provide a con- 
stant bitrate coded video stream. 

The basic "Dynamic Coding" block is illustrated in Figure 4. Where the input can 
be either an I frame or a P/B frame, or any rectangular sub-block of the I or P/B 

20 fi^me. In Figure 4, each SQ, block represents a different "^Spatial Video CODEC*. 
Given the iiqjut and the quality parameter Q, each SQ block outputs a bit-stream 
and two values: the rate Rq and the distortion Dn associated with it So, in this 
scheme N "Spatial Video CODECs" (SQO are evaluated. Each SQ, given its input 
and the quality parameter Q, returns a bitstream Bn and the corresponding distor- 

25 tion Dn and rate Rq. According to these two values a decision is made on which 
SCq has the best perfomiance and it will be selected to actually encode the input. 

In tihis document we propose a new procedure that is able to efiScientiy compare 
the performances of different '^Spatial Video CODECs" in two steps: Nbmializa- 



CK/l.V474.12EP.2Dpt 



8 



tion and Evaluation. The nonnalization step is peifonned offline, while the 
evaluation step is perfonned on the output of each SC„. In the normalization step, 
all the "Spatial Video CODECs" that are to be evaluated are aligned in tenns of 
the quality parameter Q. In the evaluation step, the late^stortionperfonnances of 
each normalized "Spatial Video CODEC" are predicted and the one wifli the best 
rate^stortion ratio is selected. The exact normalization and evaluation procedure 
are detailed hereafter. 

The nonnalization step requires the definition of a quaHty parameter Q. The pa- 
rameter Q must have a minimum value and a maximum value. In the preferred 
implementation, Q is any integer &omOto 100. Q is the result of a rate-distortion 
algorithm that controls the encoding process to obtain one of the following: 

1) a constant quaUty compression (in this case the distortion introduced by 
the coding process remains constant while the rate may vary along the 
sequence). 

2) a constant rate conqiression (in this case, the rate of the oicoded se- 
quence remains constant while the quaHty/distortion may vary along flie 
sequence). 

AU the '^Spatial Video CODECs" are normalized or "aligned" as a function of the 
same pararneter Q (quality) so as to provide the same distortion for the same vahxe 
of Q. i.e. for the same input and the same Q parameter, all normalized "Spatial 
Video CODECS" should provide a compressed frame with similar distortion, but 
possibly differing rate. 

In the proposed implementation, the "Spatial Video CODECs" are aligned accord- 
ing to the MSE (Mean Square Error) distortion measure. The alignment is per- 
formed by defining the following relationship b^een Q and the MSE distortion 
measure: 
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Vfheief(Q) is a function of the quality parameter Q. As described by Mallat in 
"Analysis of Low Bit Rate linage Transform Coding" (Th IEEE Transactions on 
Signal Processing, VOL. 46, No. 4, April 1998), for Spatial CODECs using a uni- 
form quantization, the relationship between the quality parameter Q and the quan- 
tization step A can be expressed as: /(Q) = A. In case of non-unifoim quantiza- 
tion the relationship between MSB and Q has to be respected for each Q. 

Iq a preferred implementation, in order to perform the nonnatization, the "Spatial 
Video CODECS" are uniformly quantized with a step A defined as: 

By combining equation (1) and (2) we obtain that the distortion expressed as the 
MSE is function of Q defined by: 

12 12 

where Ci controls the minimal and maximal quality and C2 the variation of the 
distortion according to Q. In particular the following values have been chosen: 
Ci == 5 and C2 = 24. This means fliat the distortion is doubled for each decrease of 
24 of the Q parameter. 

The proposed normalization procedure is not exact, but as we show in Figure 5, it 
is possible to obtain on different input signals a similar distortion for a given qual- 
ity parameter Q and this for different "Spatial Video CODECs". Four transforms 
are displayed : the Wavelet 53, Wavelet Haar, DCT and DPCM According to a 
given Q parameter, they introduce a similar distortion. 
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Moreover we claim that the error E„ in the alignment of the n-th "Spatial Video 
CODEC" is small compared to the predicted distortion: 

i>„=/'fe)+^„,KI«KI 

We have statistically evaluated equation (4) and the perfonnances of the proposed 
"Spatial Video CODEC" alignment and we rqport the results in Figure 6. Below 
the computed and predicted distortions are compared with the following relative 
measure: 

^ _ D„-MSE 
MSE 

v^^ere the real distortion measure is D„ and the predicted distortion measure is 
MSE. It turns out that most (>98% of the evaluated tests) of the predicted distor- 
tions introduce an approximation below 20% of the exact distortion. 

The evaluation step requires the computation of flie rate R and of the distortion D 
for each Normalized "Spatial Video CODEC" given a selected quality parameter 
Q and the current input block to be coded. 

According to fliese values a decision is made on whidh 'Spatial Video CODEC" 
has tile best rate-distortion perfomiance. This dedsion may be taken independ- 
ently on the rate or on tihe distortion values. In the first case, the "Spatial Video 
CODEC" with minimuTTi rate will be selected in the second, the "Spatial Video 
CODEC" with the minimum distortion will be selected. However, a better deci- 
sion is obtained if both the rate and the distortion are taken into account. This is 
possible by applying a Lagrangjan optimization of the two values: 
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with n [2;bf] representing the index of an evaluated "Spatial Video CODEC" over 
the N total number of "Spatial Video CODECs" and X representing the Lagrangian 
multiplier tiiat provides the relative weight between rate and distortion in selecting 
the best compromise. In this context, ihe selection of the best "Spatial Video 
CODEC" is done by choosing the one with the miniTmnn Ln. 

In order to perform a Lagrangian optimization, it is necessary to compute the rate, 
the distortion and the optimal X for each "Spatial Video CODEC" . In the follow- 
ing we desodbe the procedure adopted in this invention. 

The Lagrange multiplier is responsible for wei^ting the contribution of the rate 
and of the distortion in the overall evaluation of the performances of a "Spatial 
Video CODEC". Its value should be computed according to the rate-distortion 
curves of each "Spatial Video CODEC" as a function of a given input This is 
clearly too computationally expensive to be considered in a practical application. 
In our approach we jfind an approximation for X that is a function of the quality 
parameter Q. The starting point is the model of high rate proposed by Mallat. This 
model states that 

where * is a constant depending on the "Spatial Video CODEC" and the input 
signal. From equation (7) we defines the relationship between rate and distortion 
as: 



CF71.V474.12EP^ Dpt 



12 



Merging (8) and (6) we obtain: 



AD, 



We can now minimize L as a function of the onfy distortion D. To do this, we dif- 
ferentiate over D and we Gnd the tools: 



dD Z>-21n(2) 
The final solution is: 



2 1n(2)£> 



If the model of distortion of equation (3) is assumed valid, equation (1 1) may be 
written as : 

hi(2) 

Equation (12) defines flie optimum A, as a function of the quality parameter Q. By 
referring to the scheme of Figure 4, the single parameter Q controls both the en- 
coding performances of each "Spatial Video CODEC" and the Lagrange multipUer 
X. This is the strong interest and novelties of this approach: given Q, it is possible 
to measure R and D for each Normalized "Spatial Video CODEC" and at the same 
time to compute the corresponding X. The selected Normalized "Spatial Video 
CODEC" is the one that minimize L in equation (6). 

Given the parameter Q. and the input I, computing R and D can be performed in 
an precise, but computationally expensive way by applying die "Spatial Video 
CODEC" to the input and measuring the size of the encoded stream and the mtro- 
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duced distortions. This procedure is summarized in Figure ?• Note that this proce- 



dure introduces a significant amount of computations that increase with the num- 
ber N of evaluated "Spatial Video CODECs". In fact, in order to evahiate the dis- 
tortion of a single "Spatial Video CODEC" it is necessary to scale back the quan- 
tized coefiGLcients, to perform the inverse transform and to conqpute the distortion 
coxapated to the input. The rate is obtained first by performing the Entropic Cod- 
ing of the quantized coefficients and then by counting the resulting bits. 

In the preferred implementation, an approximate prediction of both R and D is 
obtained without the need of performing the Quantization, the Entropic Coder, the 
Scaling and the Inverse Transform steps. The prediction can be computed in a 
much more computationally efficient way and the iatroduced approximation does 
not affect the correct choice of the best "Spatial Video CODECV 

In the preferred implementation, the rate is estimated as a linear function of the 
number of zeros obtained after quantization of the coefficients while the distortion 
is approximated firom the distribution of the transformed coefficients. In particu- 
lar, before quantization, the histogram of the transform coefficient is computed. 
The rate is predicted as a function of the quantization step A: 



where Nxi is the number of coefficient with an amplitude equal to Xi and the pa- 
rameter a is derived firom experimental results. Note that in a preferred implemen- 
tation A is related to Q by equation (2), thus the rate is a simple function of tihe 
quality parameter Q defined by the rate-distortion algorithm. 

The distortion is predicted firom the distribution of the transformed coefficients: 
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Where xj is the ampUtude of the coefficients and N« is the number of coefficient 
with an amplitude of xj. Note that in a preferred implementation A is related to Q 
by equation (2), thus the distortion is also a maple function of the quality parame- 
5 terQ defined by the rate-distortion algorithm. 

The proposed procedure to estimate the rate and the distortion of the "Spatial 
Video CODECS" is displ^d in Figure 8. Note that in this scheme the displayed 
blodc does not provide the final bitstream B„, but only Ihe transfomi coefficients. 
To obtain the final bitstream it is necessary to perform a quantization of the coef- 
ficients followed by an entropic coding. The possibiUty of predicting the rate and 
the distortion without performing the quantization and the entropic coding is an 
important advantage &om a computational point of view. Jn feet, these two steps 
and in particular the netropic coder are coiiq)utational expensive, hi our in^le- 
15 mentation all the "Spatial Video CODECs" are evaluated, but only the one se- 
lected requires the execution of the quantization and of fee entropic coding. Com- 
pared to Figure 7, now only ihe transform is necessary to estimate the distortion 
and fee rate of a Spatial transform. Note feat when fee best "Spatial Video 
CODEC" is selected, it will be necessary to quantize fee transform coefficients 
20 and to apply fee corresponding entropic coder. 

In order to iUustrate fee principle of fee dynamic coder, two examples of measure 
of fee performance will be presented. Bofe example have been computed using a 
frame based dynamic coder. The first example shown in Figure 9 presents fee in- 
15 dividual performance of three types of "Spatial Video CODEC". The first one is a 
wavelet based CODEC, weU adapted for natural scene, fee second one is a wave- 
let based CODEC well adapted to discontinuities, and fee last one is a block based 
CODEC weU adapted to code local information. Depending of fee characteristic 
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of the scene one "Spatial Video CODEC" outperforms the others. However, it can 
be seen that the optimal "Spatial Video CODEC" between frames 450 and 500 
becomes sub optimal in the following frames. The dynamic coder will always 
choose the best "Spatial Video CODEC" in terms of MSB, providing an optimum 
5 overall p^onnances. 

The next example (Figure 10) shows in which proportion each "Spatial Video 
CODEC" scheme was chosen in order to compress a typical TV sequence of ap- 
proximately 10 minutes at 1000 kbits per second. Again we confirm that there is 
10 not a single "Spatial Video CODEC" that significantiy out-performs all other 
"Spatial Video CODECs." 

The proposed dynamic coder offers several advantages over standard £^roaches: 

1) It guarantees the choice of an optimal encoding with higfh granularity: each 
15 block 2x2 can be encoded with a different "Spatial Video CODEC". 

2) Its complexity is limited thanks to the prediction of the rate and the distor- 
tion from the transform coefficients. 

3) It is an open architecture: it extend the standard approaches to an unlimited 
number of "Spatial Video CODECs". 

20 4) It offers a high degree of adaptation: the best "Spatial Video CODEC" is 

automatically selected according to the input and the target rate. 
5) According to the application the complexity may be reduced simply by re- 
ducing the number of "Spatial Video CODECs" to be evaluated. This re- 
duction or increase could be performed at run-time. 

25 6) It allows to mtroduce application specific "Spatial Video CODECs" with- 

out loosing the gen^^lity of standard approaches. 

The dynamic codec is particularly suitable for all those application where high 
quality video signals have to be encoded in real-time at low-bit rates (< 500 Kbps) 
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for interactive TV, Corporate TV, TV over ADSL. TV broadcast. In particular for 
aU those input signals characterized by the presence of natural, synthetic and 
mixed scenes. 
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CLAIMS 

1. Method of selectiBg among N "Spatial Video CODECs'* where N is an 
integer number greater than 1, the optimum "Spatial Video CODEC" for a same 
input signal I, characterized by the following steps: 

5 

- obtaining from all the N "Spatial Video CODECs" for flie same input signal I 
and a same quality parameter Q, the rate R and the distortion measures Q being 
an integer value between 0 and 100, defined by any rate-distortion algorithm to 
provide a compiession of &e input sequence with constant rate or with constant 

10 distortion, 

- calculating an optimality criterion defined as the minimization of the value 
L„=f(R„,D„) calculated for all the n from 1 to N, « being the index of the "Spatial 
Video CODEC, where f(IU,D;,) is a fimction of R« and D„. 

15 

2. Method according to claim 1, characterized in tihat the said function is de- 
fined as f(R« J>») =R„+ADfl; , X being the Lagrange multiplier that weights the rela- 
tive influence of the rate R and of the distortion D. 

20 3. Method according to claim 1 or 2, characterized in that the input signal I is 
a natural image or a predicted image or any rectangular sub-block from a mini- 
mum size of 2x2 of the natural image or of the predicted image. 

4. Method according to one of the claims 1 to 3, characterized in that all the 
25 "Spatial Video CODECs" are aligned according to tibte MSB (Mean Square Error) 
and the quality parameter Q according to the following relationship: 

12 
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5. Methodaccordingtooneoftheclaims 1 to 4. characterized in that the La- 
grange multiplier X is dejfined as A= ^ 

2'hi(2yMSE ' 

6. Method according to one of the claims 1 to 5. characterized in that the Spa- 
tial CODECS apply an uniform quantization with a step A defined as A = 2<^-<*^> 
where d controls the minimal and maximal quality and C2 the variation of the 
distortion accordmg to quaKty parameter Q and where all the "Spatial Video CO- 

DECS" are aligned according to USE = ^ = (2<^'-g/^')y 

12 12 

7. Method according to one of the claims 1 to 6, characterized in that the rate 
Rofthe«.th"SpatialVideoCODEC"is^ximatedbyJ?=a'fiSr, whereN. 

is the number of coefficients with an amphtude equal to x^ and the parameter a is 
derived from experimental results. 

15 8. Method according to one ofthe claims 1 to 7, characterized in that the 
distortion D of the «-th "Spatial Video CODEC" is ^proximated by 

""^^^ "^12,5,^" is the anq>Utudeofthe coefficients and Nxi is 

the number of coefficient with an amplitude of xj. 



10 
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ABSTRACT 

The inveation relates to a method of selecting among N "Spatial Video CODECs" 
where N is an integer number greater than 1, the optimum "Spatial Video 
CODEC" for a same input signal 1. ^'Spatial Video CODEC" is imderstood as the 

5 combination of any transform of the input signal, followed by a quantization of the 
transform coefficients and a corresponding aitrqpic coder. Firstly &om all the N 
"Spatial Video CODECs" for the same ir5>ut signal I and a same quality parameto: 
Q, the rate R and the distortion measures D are obtained. Q is an integer value 
between 0 and 100, defined by any rate-distortion algorithm to provide a compres- 

10 sion of the input sequence with constant rate or with constant distortion. Further 
an optimality criterion is calculated. The optimality criterion is defined as the 
minimization of the value UH(R„X>n) calculated for aU the n firom 1 to N. n is 
die index of the "Spatial Video CODEC", 'w^iere fCR«4)«) is a function of R„ and 
D„. The function f may be defined as as fi(TR«,D„) =R„+MD;^ . X being the Lagrange 

15 multiplier that wei^ts the relative influence of the rate R and of die distortion D. 
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